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ABSTRACT: A multisite distance-based fluorescence resonance energy-transfer assay system was developed 
for the study of protein folding reactions. Single- and double-cysteine substitution mutagenesis was utilized 
to place sulfhydryl residues throughout the tertiary structure of the bidomain enzyme yeast phosphoglycerate 
kinase (PGK). These reactive cysteines were covalently modified with extrinsic donor [5-[[2-(2- 
iodoacetamido)ethyl]amino]-l-naphthalenesulfonic acid] and acceptor (5-iodoacetamidofluorescein) fluo- 
rescent labels. A detailed experimental strategy was followed, which revealed that, when these relatively 
large extrinsic fluorescent labels are covalently attached to properly selected solvent-exposed residues, 
they do not affect the intrinsic stabihty of the protein. The PGK crystal structure was combined with 
molecular dynamics simulations of the dyes built into the protein and time-resolved anisotropy experiments, 
in order to estimate a more realistic orientation factor, (/c^)*, for each donor/acceptor pair. Time-resolved 
and steady-state fluorescence energy-transfer e^eriments revealed that this distance assay, spanning six 
different donor- acceptor distances, is linear and accurate (to within 10—20%) over the range of 30-70 
A, This distance assay system for PGK allows for the measurement of long-range changes in intra- and 
interdomain spatial organization during protein folding reactions. The approach which we have developed 
can be applied to any protein system in which unique one- and two-site cysteine residues can be engineered 
into a protein. In the following paper [Lillo, M. P., et al. (1997) Biochemistry 36, 1 1273-11281], these 
multisite energy-transfer pairs are utilized for stopped-flow xmfolding studies. 



The use of fluorescence spectroscopy to study protein 
folding kinetics has had a very long and successful history. 
This success stemmed primarily from the fact that fluores- 
cence signals are often very sensitive to the folded state of 
proteins and measurements can be made with good intrinsic 
timing resolution. With this extreme sensitivity, fluorescence 
folding studies have generated an abundance of high-quality 
data with complex multiphasic kinetic patterns. One serious 
problem, however, has plagued all of these studies. How 
was one to interpret this complex change in (typically) an 
integrated total-intensity signal in terms of fundamental 
physical properties associated with folding intermediates? 
Given a simple steady-state fluorescence signal, no funda- 
mental interpretation is really possible. Therefore, applica- 
tion of fluorescence spectroscopy to the protein folding 
problem was in a fundamental conimdrum. What was the 
use of being able to measure a fluorescence signal change 
with high accuracy and timing resolution if you cannot 
interpret the observed signal changes in terms of any 
fundamental property of the folding intermediates? 
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The goal of the experiments described in this study was 
to develop a fluorescence assay for protein folding in which 
the signal changes could be direcfly translated into changes 
in intramolecular distances. To achieve this goal, a multisite 
fluorescence energy-transfer capability was engineered into 
a single-protein system, yeast phosphoglycerate kinase 
(PGK),* xjsing cysteine substitution mutagenesis. Due to 
many intrinsic problems associated with performing fluo- 
rescence energy-transfer measurements, extra precautions 
were taken in order to ensure that the "optical distance" assay 
was working properly. For each pair of donor/acceptor sites, 
a maximimi of nine mutant proteins were examined, in terms 
of both biological stability/activity and photophysical proper- 
ties. This highly developed energy-transfer distance assay 
system for PGK has the potential for illuminating aspects 



' Abbreviations: D-PGK, donor-labeled protein; D-PGK-A, donor- 
aiid acceptor- labeled protein; CM, center of mass of tlie emission 
spectrum; FRET, Forster resonance energy transfer, fwhm, full widtli 
at half-maximum probabilit>'; lAEDANS, 5-[([(2-iodoacctyl)amino]- 
ethyl]amino]naphthalenesulfonic acid; AEDANS, reacted lAEDANS 
covalently attached to protein; lAF, 5-iodoacetamidofluorescein; AF, 
reacted TAF covalently attached to protein; GuHCI, guanidine hydro- 
chloride; MD, molecular dynamic; MOPS, 3-{A^morpholino)propane- 
sulfonic acid; HEPES, A'-{2-hydroxyethyljpipera2ine-j\r-2-ethanesulfon- 
ic acid; EDTA, ethylenediaminetetraacetic acid; DTT, DL-dithiothreitol; 
PGK, phosphoglycerate kinase from Saccharomyces cerevvnae; Trp, 
tryptophan; {kP}*^ orientation factor which takes into account protein 
crystal structure, steric constraints, and measured rotational dynamics 
of the dyes. 
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of the protein folding problem which may not be obtainable 
in any other manner (e.g., real-time measurement of long- 
range changes in intra- and interdomain organization). The 
approach which we have developed can be applied to any 
protein system in which unique one- and two-site cysteine 
residues can be engineered into a protein, 

MATERIALS AND METHODS 

1,5-IAEDANS and 5-IAF were purchased from Molecxilar 
Probes, Inc. (Eugene, OR). DE-52 cellulose was from 
Whatman (Hillsboro, MA) and Ultrogel AcA202 from IBF 
Biotechnics (Columbia, MD). MOPS, HEPES, DTT, and 
substrates for PGK activity assays were from Sigma Chemi- 
cal Co. (St. Louis, MO). Guanidine hydrochloride (sequanal 
grade) was from Pierce (Rockford, IL). The protein assay 
kit was purchased from Bio-Rad Labs (Hercules, CA), and 
Centricon-30 microconcentrators were purchased from Ami- 
con, Inc. (Beverly, MA). 

Protein Purification and Characterization. Single- and 
double-cysteine mutants of yeast PGK were constructed using 
site -specific mutagenesis of yeast PGK cDNA in which the 
single cysteine at position 97 had been previously mutated 
to a serine, C97S (Sherman et al, 1990; Haran et al., 1992). 
In addition to two single-cysteine mutants, Q135C and 
S290C, and a double-cysteine mutant, Q135C/S290C, which 
were described previously (Haran et al., 1992), three ad- 
ditional single-cysteine mutants, S75C, T202C, and S412C, 
and five double-cysteine mutants, Q135C/S412C, S75C/ 
S412C, T202C/S412C, S75C/S290C, and S290C/S412C, 
were constructed for this study. Each mutated gene was 
sequenced to confirm the presence of only the desired 
mutations (Chen & Seeburg, 1985). 

PGK mutants were expressed in the Saccharomyces 
cerevisiae XSB44-35D strain, which lacks PGK, and purified 
as previously described (Haran et al., 1992). Specific 
activities of the mutants were determined as described 
previously (Mas et al., 1986). Concentrations of unlabeled 
proteins were determined spectrophotometrically at 280 nm, 
using an absorbance for a 1 mg/mL solution of 0.49, 
determined for wild-type (WT) PGK using the method of 
Gill and von Hippel (1 989). Specific activities of the single- 
and double-cysteine mutants were within the range of the 
specific activities of WT PGK preparations. 

Protein Labeling with lAEDANS and lAF. Covalent 
modification of single-cysteine mutants with lAEDANS and/ 
or LAF was carried out as described previously (Haran et 
al,, 1992), with several modifications. Protein samples (2—3 
mg/mL) in 50 mM HEPES buffer (pH 8.0) and 1 mM EDTA 
were incubated at 4 **C for 1 h with a 20-fold molar excess 
of lAEDANS or for 4 h with a 40-fold molar excess of lAF. 
The reactions were carried out in the dark, with magnetic 
stirring. A 1 0-fold molar excess (over probe concentration) 
of DTT was added, and incubation was continued for 1 h to 
terminate the protein modification reaction. The labeled 
samples were separated from free probes by gel filtration 
on an Ultrogel AcA202 column, equilibrated with the above 
buffer containing 1 mM DTT. For samples labeled with lAF, 
extensive dialysis against the same buffer was necessary 
following the column filtration step to completely remove 
free lAF. 

Modification of double-cysteine (Cys-PGK-Cys) mutants 
was carried out in two steps. First, the sample was incubated 



with lAEDANS under conditions which led to partial 
labeling. The resulting reaction mixture contained unlabeled 
PGK (Cys-PGK-Cys) and singly labeled, Cys-PGK-D and 
D-PGK-Cys, and doubly labeled protein, D-PGK-D. The 
conditions of the chemical modification were the same as 
those for the single-cysteine mutants described above, except 
for the incubation time, which was reduced from 1 h to 20 
min. The species with one cysteine labeled with AEDANS 
and the second cysteine free (D-PGK-Cys and/or Cys-PGK- 
D) were separated from the other components using an anion 
exchange DE-52 cellulose (Whatman, Hillsboro, OR) column 
(12 X 1.5 cm) equilibrated with 10 mM MOPS (pH 7.5) 
and 1 mM EDTA at 4 **C (see Results and Discussion) and 
then labeled with lAF to obtain doubly labeled species (D- 
PGK-A and A-PGK-D). The labeling of D-PGK-Cys with 
lAF was performed in 10 mM MOPS buffer (pH 7.5) 
containing 1 mM EDTA and 1 mM DTT, as described above 
for single-cysteine mutants. 

The protein concentration before labeling was determined 
spectrophotometrically at 280 nm, as described above. 
Concentrations of labeled protein samples were determined 
by the method of Bradford (1976), using wild-type PGK to 
prepare standard curves. After the first step, the specific 
activities of the labeled PGK mutants were decreased by 10— 
15% compared to those of the vmlabeled proteins, and by an 
additional 10—1 5% following the second reaction step. The 
net activity loss for the fmal samples was similar to the 
activity loss experienced by WT PGK when subjected to 
identical procedures (25—30%). The stabihty of the labeled 
samples was not significantly changed (see Results and 
Discussion). 

The extent of labeling with lAEDANS and lAF was 
determined spectrophotometrically, using molar extinction 
coefficients of 4300 M"* cm"* (±450) at 337 nm and 61 000 
M"* cm"* (±4000) at 492 nm, respectively. These extinction 
coefficients (and their corresponding error estimates) repre- 
sent the average extinction coefficients of PGK-bound 
AEDANS and AF determined spectrophotometrically for 
several {n ^ 25) pure preparations of single- and double- 
cysteine mutants covalentiy labeled with each probe. Cen- 
tric on-30 microconcentrators were used to concentrate protein 
samples and to exchange buffer to 50 mM MOPS (pH 7.5) 
containing 1 00 mM NaCl and 1 mM DTT, for the fluores- 
cence measurements. 

Steady-State Fluorescence Emission Spectra. Corrected 
steady-state emission spectra were obtained with a SPEX 
1681 Fluorolog spectrofluorometer (Edison, NJ), equipped 
with a 450 W xenon arc hmp. AEDANS and AF fluores- 
cence emission were collected from 400 to 700 nm by 
exciting the donor and donor- and acceptor-labeled proteins 
at 340 nm. Acceptor emission was collected from 465 to 
700 nm, by exciting directly AF at 440 or 460 run in the 
donor/acceptor samples. Tryptophan emission spectra were 
excited at 295 nm, with emission collected from 300 to 410 
nm. The excitation and emission band width were 1 —2 nm 
for all the spectra. Background fluorescence from unlabeled 
protein, buffer, and denaturant was recorded and subtracted 
from the protein spectra. 

Steady-state measurements were performed in a 150 //L 
fluorescence cuvette with 10 mm (excitation) and 2 mm 
(emission) path lengths. The absorbance at the excitation 
wavelengtfi was always lower than 0.03 (10 mm path length), 
and the absorbance at the maximum of the absorption spectra 
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of fluorescein in the donor/acceptor sany>les was lower than 
0.1. The temperature of the measurements was 25 ± 0.1 

Time-Resolved Fluorescence Measurements. Time-re- 
solved fluorescence measurements were performed using a 
frequency-doubled Coherent Antares Nd:YAG laser (Palo 
Alto, CA) synchronously puirq)ing a dye laser (Coherent 702) 
using kiton red (Exiton, Dayton, OH) for AEDANS and AF 
at 325 nm. A laser repetition rate of 4 MHz was utiHzed, 
and the pulse width was approximately 1 ps. A half-wave 
plate in the excitation beam was utilized to rotate the 
excitation polarization to horizontal for the determination of 
the polarization bias {G factor) of the detection instrumenta- 
tion. The fluorescence emission signal was passed through 
a Glan-Thompson polarizer and a Hoya (Fremont, CA) 420 
cut-on filter and focused into a SPEX 0.22 m emission 
monochromator set at 450 nm (donor emission studies) or 
560 nm (acceptor emission studies), with a band width of 
2-4 nm, and detected by a 6 jxm microchannel plate detector 
(Hamamatsu R2809U-01, Bridgewater, NJ) operating in the 
single-photon-counting, time-correlated mode (Bloom et aL, 
1994; Jones et al., 1995). The impulse response functions 
were typically 40— 80 ps at fwhm. Timing calibration was 
6—12 ps/chaimel, with 5000—8000 channels of data obtained. 
Typical total intensity sum curves had approximately 10 000 
counts in the peak channel (?«30 x 10* total photons). Time- 
resolved measurements were performed on 120 fih samples 
in cuvettes with a 3 x 3 mm path length. Protein solutions 
(1 -4 fjM) were prepared in 50 mM MOPS, 100 mM NaCl, 
1 mM EDTA, and 1 mM DTT, at pH 7.5. The temperature 
of the measurements was 25 ± 0.2 °C. During acquisitions, 
the emission polarizer rotated from vertical to horizontal 
every 30 s to measure the parallel (||) and perpendicular (1) 
components of the emission. The thermostated sample 
holder was rotated from the donor-only cuvette to the donor/ 
acceptor cuvette every 60 s (ISS Koala unit, Urbana, IL). 

Data Analysis. Time-resolved fluorescence anisotropy 
decays were used to study internal rotational motions of the 
probes attached to the protein. Simultaneous analyses of the 
parallel and perpendicular intensity decays were performed 
using Globals Unlimited (Urbana, IL) as described in 
Beechem et al. (1991). Anisotropy decays, r(/), were found 
to be adequately described as a biexponential function: 



r{i) = /5f exp(-/%) + p. exp(-//0g) 



(1) 



where fix and are the amplitudes of the fast local probe 
motion (0f) and overall global protein motion (0g), respec- 
tively. Within the context of an orientation factor calculation 
(see below), )3g represents the limiting anisotropy (i.e., the 
long-time asymptote) for the fast local motion of the probe 
OSf). A nonassociative anisotropy decay model [e.g., Beechem 
et al. (1991)] was utilized for all of the labeled proteins. The 
goodness of the fit was judged by examination of randomness 
in the residual distribution and recovered global % square 

Equilibrium Unfolding Studies Monitored by Steady-State 
Fluorescence. All of the site-specific energy-transfer pairs 
have been designed in order to examine changes in distances 
during equiUbrium and kinetic folding studies. Since experi- 
ments need to be performed under both native and unfolded 
conditions, the number (and importance) of the "donor-only" 
controls are crucial for obtaining meaningful energy-transfer 



results. Since the final resonance energy-transfer efficiencies 
are calculated as the ratio of two different protein samples 
(donor-only and donor/acceptor pair), one has to ensure that 
both samples have identical (or nearly identical) stabilities. 
The unfolding transitions for D-PGK and D-PGK-A sanq)les 
were monitored using fluorescence (steady-state and time- 
resolved) techniques with guanidinium hydrochloride (Gu- 
H CI) as a denaturant. 

. For some measurements, it was foimd useful to quantitate 
folding transitions using the measured center of mass, CM 
(eq 2), of the emission spectra: 



CM = 



(2) 



where F(A,) is the fluorescence intensity measured at the 
emission wavelength A/. Changes in the center of mass of 
the emission spectra as well as fluorescence total intensity 
of AEDANS in D-PGK (Aex = 340 nm), AF in D-PGK-A 
(Aex = 460 nm), and tryptophan (k^x — 295 nm) in both 
D-PGK and D-PGK-A samples were utilized to monitor 
guanidine-induced transitions. Unfolding transitions were 
also examined using the energy-transfer efficiencies. 

The protein samples (series of protein solutions at identical 
protein concentration) were incubated at each GuHCl 
concentration (from 0 to 2 M) for a minimum of 3 h prior 
to measurement to assure that equilibrium conditions had 
been reached [the proteins are completely iinfolded at 2 M 
GuHCl (Szpikowska et al., 1994)]. The GuHCl concentra- 
tion of the stock solutions was determined using the refractive 
index of Pace et al. (1989) and an empirical equation 
(Nozaki, 1 972). Folding reversibility of aU the labeled PGK 
samples was examined by dilution of the unfolded samples 
fi-om 1.5 to 0.15 M GuHCl (MOPS buffer). AU the 
preparations contained 1 mM DTT. 

The transition midpoints and cooperativity indexes were 
calculated by fitting the experimental data (using a nonlinear 
least-squares algorithm, Sigma Plot software, San Rafael, 
CA) to the equation derived from the denaturant binding 
model as described by Szpikowska et al. (1994). This 
method of analysis was selected to allow for a direct 
con:q>arison of the stability measured for wild-type PGK and 
those of all the variously labeled mutants. 

The unfolding curves were normalized to the apparent 
fi-action of the unfolded form: 



'="(u)=(>;b.,- Wu->'p) 



(3) 



where yobs is the observed physical parameter (fluorescence) 
at a given denaturant concentration and Tp and Tu are values 
of the r characteristic of the folded and unfolded conforma- 
tions, respectively, at the same denaturant concentration. 

The baseline correction was done, prior to normalization, 
using linear extrapolation of the baselines in the pre- and 
post-transition regions to obtain estimates of and Y\j in 
the transition region (Pace et al., 1989; Szpikowska et al., 
1994), 

Measurements of Energy Transfer. The Forster distance, 
/?o, is the dye-to-dye separation distance for 50% efficiency 
of resonance energy transfer. This constant is calculated (in 
angstroms) using the spectroscopic properties of the donor 
and acceptor: 
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R,= Q.2n{n-'Q^4'^ (4) 

A refractive index («) of 1 .4 was used for all protein samples 
(?iBUFFER = 1.337, n2MGuHa = 1.375, and /iprotein ^ 1-4). 
The fluorescence quantum yield of the donor samples (Qd) 
was determined for each position by reference to a solution 
of quinine bisulfate in 0.1 N H2SO4, using a quantum yield 
for quinine of 0.51 (Velapoldi, 1972). /(M"^ cm"^ nm'*) is 
the spectral overlap integral between the emission spectrum 
of the donor and the absorption spectrum of the acceptor 
and was approximated by 

[Y/:,{X)e^{X)X'^ky[Yf^{X)iSX\ (5) 

where Fd(A) was the corrected emission spectrum of the 
donor-labeled samples (arbitrary units) and €/J^X) the extinc- 
tion coefficient (M~* cm"*) of the acceptor from the donor/ 
acceptor samples. Acceptor extinction coefficients at A = 
492 nm were determined for all the different labeled 
preparations and found to be nearly identical (e'*'^ = 61 000 
M-> cm-i). 

The orientation factor (hP) depends on the relative orienta- 
tion of the transition dipoles of the donor and acceptor. In 
the samples described in this work, multiple estimations of 
the orientation factor were performed. The first estimation 
was to calculate the Forster distance Ro, introducing into eq 
4 the isotropic dynamic average value of (k^ = V3; Dale 
& Eisinger, 1975). In addition, steady-state and time- 
resolved fluorescence anisotropies of all the labeled samples 
were examined in order to obtain a more reaHstic estimation 
of the proper orientational averaging. From the measured 
limiting anisotropy for the fast local motion O^g) (refer to eq 
.1), the donor and acceptor depolarization factors {di,^) and 
{d^) were calculated as {df) — (ySg Iri)^^ (Soleillet's theorem; 
Dale et al, 1979). The half-cone angle, 6^, over which the 
probes move, was also estimated using (fig/rfY^ — cos ^c(l 
+ cos 6c)l2 (hindered rotation model; Kinosita et al., 1977). 
The measured anisotropy value at time zero (i.e., + 
eq 1) was found to match the fundamental anisotropy (rf) 
previously reported for both of these probes (donor rf = 0.20 
± 0.02, acceptor rf = -0.14 ± 0.02) (Hudson & Weber, 
1973; Chen & Bowman, 1965). Applying averaging analy- 
sis, we estimated the maximum and minimum (k^) values 
(eqs 6 and 7) from the measured depolarization factors (Dale 
el al., 1979): 

('^'>nun = %[l-((0 + <0)/2] (7) 

and are the Forster distances derived from the 
minimum and maximum values of from the relationship 

j^maxfum = Q 5^^2ynaxAnm)l/6^(2/^) p^j. unfoldcd SampleS, 

depolarization of the fluorescence anisotropy was essentially 
complete before emission (see Table 3), allowing the use of 

Looking for a Better Estimation of </c^)* The above (k^) 
(min/max) calcxilation, however, does not make use of the 
crystal structure of PGK and the known site locations of the 
inserted cysteines. For instance, the (k^) (min/max) calcula- 
tion above would allow the dye probe to sample confonna- 
tions which would clearly not be allowed due to simple steric 
constraints. To obtain a (tc^) calculation wiiich was consistent 



with the known crystal structure, the donor and acceptor 
probes were built into the crystal structure (Brookhaven 
Protein Data Bank file 3PGK; Watson et al., 1982) so that 
a "sterically appropriate" orientational averaging could be 
performed. For the serine — * cysteine mutants (S75C, 
S290C, and S412C), it was relatively easy to replace the 
serine OH with the cysteine SH, maintaining the same 
geometry as in the crystal structure. The probe label was 
then built onto the SH as S-CHi-CO-NH-fluorescein or 
S-CH2-CO-NH-CH2-CH2-NH-naphthalene-l -sulfonic acid. 
For Q135C, the glutamine side chain was replaced with 
cysteine so that the sulfur atom assumed the position of the 
glutamine C-y atom For T202C, the threonine side chain 
was replaced with cysteine so that the sulfur atom assumed 
the position of the threonine 0-y atom. Molecular models 
of AF and AEDANS were constructed from fragments 
supplied with BIOGRAF (Molecular Simulations Inc., 
Waltham, MA) and energy minimized using the DREIDING 
II force field (Mayo et al., 1990). The region of conforma- 
tional space accessible to each probe was evaluated by 
dynamics simulation in which four torsion angles (bonds 
connecting C-o, C-^, S-y, probe methylene C, and probe 
carbonyl C) were allowed to vary randomly and sim\ilta- 
neously (Monte Carlo simulation). Of the 500 conformers 
generated, those with the lowest energy were saved (^^^100 
conformers) and their coordinates averaged to generate an 
estimate of the equilibrium configuration of the emission 
oscillator for the donor and the absorption oscillator for the 
acceptor. The MD calculations allowed all but the sterically 
restricted configurations of the probes to be adopted. In 
theory, the MD simulations could have been utilized to obtain 
not only the "mean configuration" of the probe but also the 
full (k^) by calculating depolarization factors from the 
individual MD trajectories. However, it was felt that the 
measured time-resolved anisotropy provided a much more 
accurate measurement of the probe's motion. Therefore, the 
mean probe configuration from MD was combined with the 
measured time-resolved anisotropy of the probes motion to 
obtain a final formula for (/c^)*: 

<^^)* = MD(0D. 0A, V)<0<0 + VsCl - <0) + 

cosXXO(l-<0) (8) 

where 

MD(0D,^A,V') = 

[sin(^D) sin(0A) cos(V') - 2 cos(^d) cqs(^J]^ (9) 

and ^A> and jp represent the MD-determined average 
angular position of the donor and acceptor dipoles and the 
azimuthal angle connecting the two, respectively [see Dale 
et al. 1979)] and (d:/) and (^/a*) are the measured depolar- 
ization factors from the fluorescence anisotropy decay (see 
above). It is interesting to note that the MD simulations 
revealed the same trend in rotational mobility as that 
measured by the time-resolved anisotropy. 

Steady-State Energy Transfer. The ratio of steady-state 
fluorescence intensities for the donor only (/d) and the donor 
in the presence of the acceptor (/da) (for identical protein 
concentrations of D-PGK and D-PGK-A, Aex = 340 nm, Aem 
— 400-460 nm) is proportional to the average energy- 
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transfer efficiency, as described by eq 1 0: 



(10) 



Two labeled samples, D-PGK and D-PGK-A, were used for 
each set of experiments. The transfer efficiency was also 
calculated from sensitized acceptor emission. 

The measured energy-transfer efficiency was xised to 
calculate the average distance, (R), between the donor and 
acceptor transition dipoles by using eq 11: 



(11) 



Time-Resolved Energy Transfer (Empirical Exponential 
Analysis). Determination of the average energy-transfer 
efficiencies, (£), was also obtained from the ratio of the 
average lifetimes of the D-PGK, (td>, and D-PGK-A, <tda), 
calculated from a nonlinear least-squares multiexponential 
fitting procedure: 



^D(0 = X^yexp(-//TD3 



(12) 



The average donor lifetime, (td), is calculated with the 
equation 



<i^D> = Xa,TD/£a,. 



(13) 



7=1 1=1 



tti and td, are the amplitude and lifetime, respectively of the 
/th component. Three lifetimes are required for adequate 
fitting for the donor-only samples at Aem = 450 nm, and two 
lifetimes are required at Aem = 560 nm. The fluorescence 
decay of the donor/acceptor pair was also empirically 
analyzed by multiexponentials. This analysis gave the 
average (td(a)}. The average energy-transfer efficiency was 
calculated by <£} = 1 - <td(a))/(td): Since {r d(a)) and (td) 
are obtained without the need of absolute protein concentra- 
tions, uncertainties associated with protein concentration 
determination are eliminated by the time-resolved fluores- 
cence measurements. 

Time-Resolved Distance Analyses, Discrete donor— ac- 
ceptor distances were also obtained from direct simultaneous 
analysis of both the donor-only and donor/acceptor decays 
using 



1=1 



' 1 1 
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— \ — 
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(14) 



If the protein has conformational flexibility in solution, the 
interchromophoric distance will not be unique, and eq 14 
will not be able to adequately represent the measured data. 
The observed donor decay in the presence of the acceptor 
will contain multiple contributions with a probability dis- 
tribution function dependent upon the distribution of D- A 
distances, P(R) (Haas et al., 1978, 1988; Haas & Steinberg, 
1984): 



Vj = 7r"^W Za^exp 



1=1 
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N-domain 



135 C-domain 

290 




202 

Figure 1 : Schematic representation of the crystal structure of yeast 
PGK (Brookhaven Protein Data Bank file 3PGK), sliowing the 
position of the residues that were modified (•). Of the six distances 
measured in this study, the 135 290, 75 ** 290, 290 ^ 412, and 
412 -* 202 are interdomain distances and 135 ** 412 and 412 — 
75 are intradomain distances. Residue 412 is only three residues 
apart from the C-terminal end, and it is situated on the N-termiual 
domain in the folded protein. 

i^ni is the maximum D— A distance. In this study, we used 
a Gaussian form for the distribution P{R) to describe the 
shape of the distance distribution: 



P{R) = 



1 



exp 



2<^ 



(16) 



where R is the D— A distance, R^ is the mean distance of the 
calculated distribution (most probable distance), and a is the 
standard deviation of the distribution. We did not attempt 
to resolve the intrasite diffusion coefficient in these analyses 
(Haas & Beechem, 1989). In these global analyses, the 
donor-only lifetimes, td„ were linked between both D-PGK 
and D-PGK-A decay curves. It was found that all of the 
observed dynamic quenching could be adequately represented 
(see Figure 3 for a typical fit) using only the energy-transfer 
distance terms. /?c and o were adjustable parameters in the 
fitting of the donor/acceptor decay data. The and 
residuals distribution were utilized to judge the goodness of 
the fit. 

RESULTS AND DISCUSSION 

The locations of the cysteines engineered into PGK are 
shown in Fig\ire 1. The overall design concept is the 
generation of xinique fluorescence labeling sites by replace- 
ment of one or two selected residues with a cysteine. In 
this work, an initial set of six different energy-transfer pairs 
was characterized in terms of both site-specific photophysical 
properties and enzyme stability. Although it is possible (in 
theory) to make an energy-transfer measurement with only 
two labeled protein samples (donor only and donor/acceptor 
pair), we decided to take a more rigorous approach to 
characterizing the energy-transfer pairs in this system. To 
generate a completely defined "distance assay" between any 
single site on PGK, as many as nine proteins were examined. 
This experimental design strategy included a range of extra 
controls which are important when using fluorescence 
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Table 1: Summary of the Labeled Proteins Examined for the Photophysical Characterization of Each Energy-Transfer Pair Cysj -* CysJ 



sample 


name 


Cys/ — Cysy 


no. of 
cysteines 


fluorophore 


donor only (D-PGK) 
acceptor only 

donor-acceptor (D-PGK-A) 


i-single cysteine 
7-single cysteine 
i-two cysteines 
y-two cysteines 
('\/)-two cysteine average 
(i,j)-cwo cysteine "double donor" 
/-single cysteine 
y-single cysteine 
iyj specific label 
/ 1 specific label 
/,y average label 


D--- 
---D 
D- - -Cys 
Cys- - -D 

D- - -Cys + Cys- - -D 

D---D 

A--- 

---A 

D — A 

A*-D 

D— A+ A— D 


1(7) 
2(Uj) 

2 a J) 

2(1. J) 

Uf) 

HJ) 

2{iJ) 

2(/,y) 

2 a. J) 


AEDANS (/) 
AEDANS (t) 
AEDANS if) 
AEDANS (j) 

AEDANS (ij + AEDANS {J) 
AEDANS (/,y) 
AF (i) 

AEDANS (i j and AF {J) 
AEDANS (/|andAF(0 
AEDANS (i) and AF (J) and + 



energy transfer to monitor protein folding reactions. Table 
1 summarizes the various modified proteins which have been 
examined for each energy-transfer pair. It is important to 
understand the symbols in Table 1 . For instance, fluores- 
cence energy transfer between sites / and J requires cysteines 
at both the / and J sites. To "force" the donor label 
(lAEDANS) exclusively to site /, the single-cysteine mutant 
is examined. Similarly, to force the donor to site y, the single- 
cysteine mutant at site J is examined. These single-site 
mutants are extremely useful in unequivocally characterizing 
the photophysical characteristics of each individual labeling 
site (i.e., rotational mobility, lifetimes, emission spectra, etc.). 
These single-site mutants, however, are not particularly well 
suited for serving as donor-only or acceptor-only controls 
because the double-cysteine mutants mxist be used for all of 
the actual fluorescence energy-transfer measurements. There- 
fore, additional spectroscopic studies were performed on the 
double-cysteine mutants. 

In order to be able to individually characterize the various 
labeled protein species in the double-cysteine mutants, it is 
important to be able to chemically separate each labeled 
species (refer to the multiple two-cysteine-labeled proteins 
in Table 1). For the labeled PGK proteins, it was possible 
to separate many (in some cases all) of the singly and doubly 
labeled proteins from each other. For the double-cysteine- 
containing proteins, a two-stage reaction and purification 
procedure was utilized. 

Reaction vnth lAEDANS (donor) generates a total of four 
protein species: Cys-PGK-C^s (unlabeled protein), D-PGK- 
Cys, Cys-PGK-D, and D-PGK-D. To separate these multiple 
labeled species, anion exchange chromatography was utilized. 
Figure 2 A depicts a chromatographic profile for the double- 
cysteine mutant Cysl35/(3ys412, obtained from the initial 
reaction mixture with lAEDANS. The peaks in this elution 
profile were identified by comparing their elution with the 
elution of the controls: single-cysteine mutants specifically 
labeled at each site, the unlabeled protein, and the exhaus- 
tively labeled double-cysteine mutant. Peaks I and III 
(Figure 2A) were identified as the unlabeled and doubly 
labeled proteins, respectively, and peaks IIA and IIB 
represented D-PGK-(3ys412 and Cysl35-PGK-D, respec- 
tively. Being able to isolate each of the singly labeled donor 
species (in this case) allowed the measurement of distances 
from botii the specific and the average donor/acceptor 
labeling positions (see Table 1). The extent of separation 
of two singly labeled species (D-PGK-Cys and Cys-PGK- 
D) depends on the location of the probe. The elution volume 
for the singly labeled mutants (peak I) increases in the 
following order: C75 < C135 < C202 < C290 < C412. 
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Figure 2: Chromatographic separation of D-PGK and D-PGK-A 
for the C135-C412 mutant. Double-cysteine mutants were labeled 
with T AEDANS (D) and TAP (A) in a two-step procedure as 
described in the Materials and Methods. Labeled proteins were 
separated on a DE-52 cellulose column (12 x 1 .5 cm) with a linear 
salt gradient (0 0,25 M KClJ. Two milliliter fractions were 
collected at a flow rate of 20 mL/h. Shown in panel A is the 
separation of the mixture of labeled and unlabeled species following 
labeling of the C135-C412 PGK mutant with lAEDANS. In this 
case, the singly D-labeled species modified at positions 135 and 
412 eluted as separate peaks (peaks ELA and HB, respectively). Peaks 
I and HI represent im labeled and doubly labeled protein, respec- 
tively. In some experiments, peaks TTA and TIB were pooled together 
prior to lAF labeling. In one experiment, only the 4 12- 1 35 mutant 
labeled with AEDANS at C4 12 (peak IIB in panel A) was subjected 
to modification with LAF to produce site-specifically labeled protein 
with D at position 412 and A at position 135 (denoted 412 135). 
Panel B shows separation of D412-PGK-A135 (peak IT) from the 
remaining D4 1 2-PGK-SH. 

Once the singly labeled donor species were purified, these 
proteins were farther reacted with lAF. Purification of these 
labeled proteins was much easier, because only two protein 
species are present: D-PGK-Cys and D-PGK-A. A chro- 
matography profile of the acceptor-reacted protein using peak 
IIB as starting material (in Figxire 2A) is shown in Figure 
2B. Comparison with the elution profile of the D-labeled 
single-cysteine control proteins and examination of the ratio 
of absorbance at 280 and 492 nm confirmed that peak I was 
D-PGK-Cys and peak II was D-PGK-A. In all cases, the 
large chromatographic separation distance between the 
elution pattern of D-PGK-A and D-PGK-SH protein peaks 
eliminated any need for spectroscopic corrections associated 
with incomplete labeling of the D-PGK-SH samples with 
the acceptor. 

This biochemical separation of the various donor/acceptor 
species allowed for the photophysical characterization of 
many of the protein species described in Table 1 . These 
samples were also characterized for folding properties (Figure 
6 and Table 2) and enzyme activity. Only mutants which 
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Table 2: Folding Transition Midpoints for All Six D-PGK and 
D-PGK-A Energy-Transfer Pairs 



energy-transfer pair 


Cn (M) (tryptophan/ 


C„(Mj(all/ 


135 290 


0.60 


0.68 


75 290 


0.53 


0.53 


I35**4I2 


0.63 


0.63 


412— 135 


0.57 


0.62 


412 — 75 


0.59 


0.58 


290** 412 


0.62 


0.62 


202** 412 


0.59 


0.60 


column average 


0.59 ± 0,03 


0.61 ±0.05 



" Center of mass of tr>'ptophan emission spectra of D-PGK and 
D-PGK-A. *From the 0-1 normalized data corresponding to all of 
the spectroscopic markers described in Figure 5 for D-PGK and 
D-PGK-A. 



could pass all of the required tests, which confirmed their 
close similarity with respect to WT PGK, were utilized. 

The experiments described below demonstrate that all of 
the characteristic photophysical signatures which indicate 
energy transfer are apparent for all six of the donor/acceptor 
pairs described in this study, hi all six energy-transfer pairs, 
one observes (1) a quenching of the donor fluorescence in 
the presence of acceptor in the native state which decreases 
dramatically upon xmfolding (Figure 3, upper inset), (2) an 
enhancement of the acceptor fluorescence in the presence 
of donor in the native state which decreases upon unfolding 
of the protein (Figure 3, top), (3) a dynamic quench of the 
fluorescence lifetime of the donor in the presence of acceptor 
in the native state (Figure 3, bottom) which is greatly 
diminished upon unfolding, and (4) appearance of charac- 
teristic energy-transfer kinetics in the acceptor time-resolved 
fluorescence which yields essentially identical distances as 
compared to the donor-emission analysis (Table 5). 

A long series of "control-type" experiments will be 
described first, not in order to estabhsh the existence of 
fluorescence energy transfer for each pair but rather to 
determine if there are any unique characteristics associated 
with any energy-transfer pair which would preclude its use 
in a real-time kinetic distance assay for protein folding. 

Photophysical Characterization of Each Donor-Labeling 
Site. AEDANS — * AF represents perhaps the most utilized 
fluorescence energy-transfer pair which has been character- 
ized to date (van der Meer et al., 1994). One of the major 
concerns with using AEDANS as a donor involves the fact 
that the fluorescence emission is very environment sensitive 
and multiexponential decay behavior on labeled proteins is 
often found [e.g., Chantler and Tao (1986)]. The origin of 
the heterogeneous character of the emission of proteins 
labeled with lAEDANS is unknown, hi this energy-transfer 




time (ns) 

Figure 3: Quantitative determination of the origin of the fluores- 
cence changes in the D-PGK-A sample using both steady-state (SS) 
and time-resolved (TR) fluorescence methodologies. (OMGuHCl, 
imlabeled spectrum) (SS) Corrected steady-state fluorescence 
emission spectra of D-PGK and D-PGK-A (135 290), folded (0 
M GuHCl) and unfolded (2 M GuHCI) proteins, at equal concentra- 
tions (2 fiU), in MOPS buffer at pH 7.5 and 25 °C, with an A„ of 
340 nm. (Inset) Donor emission increase in D-PGK-A when the 
protein unfolds. (TR) Time-resolved AEDANS fluorescence total 
intensity decays for the D-PGK (D) and tlie D-PGK-A (DA) for 
the 135 ^ 412 sample tmder native conditions (MOPS buffer at 
pH 7.5 and 0 M GuHCI at 25 °C). The differences between the 
measured data and the fits from a simultaneous analysis of the donor 
and donor/acceptor decays (Globals Unlimited, see Materials and 
Methods) reveal a random residual pattern. 

study, the lAEDANS was covalently attached to five different 
Cys residues of PGK: Cys75, Cysl35, Cys202, Cys290, and 
Cys412. Therefore, it was important to spectroscopically 
characterize each individual donor-only mutant (in both the 
native and unfolded states) in terms of (1) quantum yield, 
(2) emission spectral shape, (3) fluorescence lifetimes, and 
(4) rotational properties, before labeling the protein with an 
acceptor fluorophore. 

Table 3 summarizes the complete photophysical charac- 
terization of AEDANS (donor) labeled at each specific site 
in PGK. The fluorescence hfetime decay requires three 
exponentials for all labeled proteins. All sites had a long 
lifetime term that varied from 14 to 28 ns which accounts 



Table 3: Steady-State and Tune-Resolved Fluorescence Total-Intensity and Anisotropy Decay Parameters of All the Single-Cysteine D-PGK 
Samples as a Fuuction of the Labeled Positiou 

- 



donor-only 


quantum yield 


Aem max 




Ti (ns) 




T: (nsj 




T3 (ns) 


<r) 




</»B(ns) 


Pi 


<Ai;(ns) 


site 


(±0.03J 


(nmj 


ai 


(±0.5) 




(±0.8J 




(±0.2) 


(steady state) 


(±0.02) 


(±5) 


(±0.03) 


(±0.3) 


290 


0.54 


496 


0.53 


27.8 


0.26 


8.0 


0.21 


0.5 


0.076 


0.12 


30 


0.07 


0.6 


412 


0.40 


503 


0.63 


17.8 


0.10 


4.6 


0.27 


0.5 


0.081 


0.14 


23 


0.06 


0.9 


75 


0.40 


504 


0.60 


17,9 


0.15 


5.0 


0.25 


0.2 


0.070 


0.12 


21 


O.IO 


0.6 


135 


0.35 


513 


0.56 


15-2 


0.18 


5.5 


0.26 


0.6 


0.047 


0.08 


15 


0.12 


0.6 


202 


0-36 


514 


0.42 


14.2 


0.34 


6.5 


0.24 


0.3 


0.053 


0.07 


18 


0.13 


0.9 


all" 


0.35 


515 


0.5 


13.5 


0.18 


4.5 


0.32 


0.3 


0.02 


0.06 


4±2 


0.13 


0.6 



•* Unfolded state; and also represent local motion components. Total-intensity (cu and t;) and anisotropy and <^i) symbols are defined in 
eqs 12 and I, respectively. 
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Wavelength (nm) 

Figure 4: AEDANS fluorescence-corrected emission spectra of 
all six energy-transfer donor-only san^les (two-cysteine, single- 
donor labeled PGK mutants). Proceeding in the order of increasing 
wavelength of maximum emission: 75 290, 135 290, 412 ^ 
75, 135 412, 290 412, and 202 412. MOPS buffer at pH 
7.5, where T = 25 °C and Aex = 340 nm, at 2 fiU. The dotted line 
spectrum represents that observed for all of the unfolded proteins 
(2 M GuHCl). 

for 75-95% of the total fluorescence intensity, a medium 
lifetime of 5-8 ns, and a short lifetime of 200-600 ps (<1% 
of total). In the unfolded state, the observed lifetimes for 
all of the mutant-labeled proteins are identical (within 
experimental error; see Table 3). 

AEDANS bound to Cysl35 and Cys202 has properties in 
the native state which are nearly identical to what is observed 
for the unfolded protein (see Table 3). These positions have 
the lowest quantum yield (0.35, similar to the unfolded 
proteins), the more red-shifted emission spectra (Amax = 512— 
515 nm), shorter fluorescence lifetimes, and a less restricted 
motion (^f, eq 1) than the probes at other locations. Positions 
135 and 202 are absolutely ideal labeling sites, in that as 
the protein undergoes a folding transition the dye's rotational 
and photophysical properties are essentially invariant. 

AEDANS labeled at Cys290 has the highest quantum yield 
(0.54), the most blue-shifted emission spectra (Amax = 494 
nm), longest fluorescence lifetimes, and a more restricted 
rotational environment. Compared to Cysl35 and Cys202, 
Cys290 is in the most protective envirormiental position of 
any of our labeled proteins. The photophysical character- 
istics of Cys412 and Cys75 are between these two extreme 
classes, but much closer to the unfolded protein state than 
is Cys290. When the protein unfolds (in the presence of 
> 1 M GuHCl, Table 3 all), the different sites have identical 
spectroscopic properties. 

Although the single-cysteine mutants were very useful for 
characterization of the individual properties associated with 
each labeled site, these proteins are not actually utilized in 
the fluorescence energy-transfer efficiency calculations. For 
these calcvdations, the two-cysteine proteins are required. 
Figure 4 shows the corrected emission spectra of all of the 
donor-alone (two-Cys mutants), for the native and unfolded 
state. The emission peak changes from 499 nm for 75 
290 to 513 nm for 202 412. The unfolded proteins all 
have an emission maximum of 5 1 5 nm. These measured 
spectral shifts and quantum yield changes were taken into 
account in the calculation of Ro (see below and Data 
Analysis). One can see, however, that for these samples the 
actual spectral changes associated with each particular D— A 
site are relatively small, generating an average change in 
overlap integral of only «8%. 

The comparison of the sum of the two emission spectra 
of each donor single-Cys sample (weighted for the optical 
density OD at the excitation wavelength) with the OD 



weighted average emission spectra of the doubly labeled and 
the singly labeled two-Cys samples (our control in the FRET 
experiments) allows us to estimate the relative labeling 
efficiency at each site of the two-Cys mutants and the 
possible effect of the free cysteine on the fluorescence 
emission of the donor-only sample. Rather surprisingly, for 
135 290, the two-Cys mutant with the most distinctly 
different donor-only spectrum, the measured emission spectra 
of the donor-only tAvo-Cys samples could be exactly repre- 
sented as the linear combination (50% each) of the emission 
spectra of the two single-Cys donor-only samples (Cysl35 
and Cys290). These results reveal that, even under condi- 
tions where the cysteine sites are most different in the 
photophysical environment, the donor probe will be evenly 
distributed between both sites. It is also clear that the 
presence of the free Cys in the donor-alone sample has 
absolutely no effect on the overall fluorescence at the labeled 
site. 

Experiments were also undertaken in order to test whether 
the presence of two cysteines would affect the intrinsic 
stability of the protein. PGK's stability can be convenientiy 
monitored using intrinsic tryptophan fluorescence (Szpi- 
kowska et al., 1994; Sherman et al., 1995; Beechem et aL, 
1995). Similar to the unlabeled protein, when the labeled 
protein unfolds, a large increase in the tryptophan fluores- 
cence is observed and the typtophan emission spectra are 
shifted to the red. The advantage of using the tryptophan 
fluorescence of the labeled protein is that it represents an 
independent way to study the stability of the D-PGK and 
D-PGK-A samples. Tryptophan fluorescence emission spec- 
tra were found to be identical at 0.0 and 2.0 M GuHCl for 
both D-PGK and D-PGK-A samples (all mutants), indicating 
that tryptophan to AEDANS or AF energy transfer in this 
system must be very small (data not shown). Unfolding 
experiments were performed on all of the two-Cys mutants, 
with a special emphasis on examination of the singly labeled 
and doubly labeled proteins. 

In addition to tryptophan fluorescence, it was found that 
one can monitor the unfolding transition with each of the 
various fluorescence probes at each site using the emission 
wavelength shifts in AEDANS and AF (even though these 
shifts were very small, they were very reproducible and easily 
quantitated). Figure 5 summarizes the equilibrium unfolding 
transitions monitored from all the different fluorophores 
(AEDANS, AF, and Trp, average FRET efficiency). The 
equilibrium unfolding transitions for all six donor/acceptor 
proteins followed a simple two-state model (see Figure 5). 
In Table 2, we compare the Cm determined from the 
tryptophan emission of the D-PGK and D-PGK-A samples, 
with tiie Cm*" values from the simultaneous analysis of all 
the different fluorescence changes. Rather remarkably, for 
these derivatized proteins, the presence or absence of the 
fluorescence label had essentially no effect on the overall 
GuHCl stability (i.e., the D-PGK and D-PGK-A unfolding 
transitions are con^letely overlapping). In addition, for all 
the samples, the cooperativity indexes were also similar, and 
in the range of 10 ± 2. The overlap of all of the unfolding 
transitions indicates that the singly labeled donor-only sample 
is a good control for the doubly labeled donor/acceptor 
sample. The fact that the individual stabilities between 
different double-cysteine mutants are essentially coincident 
allows a direct comparison of changes in distance between 
different D/A pairs without having to take into account 
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Figure 5: Equilibrium unfolding transitions monitored by steady- 
state fluorescence using multiple spectroscopic monitors (see 
below). The fraction of unfolded protein at each GuHCl concentra- 
tion, calculated from the change in the center of mass of the 
emission spectra or by changes in energy-transfer efficiency, is 
plotted as a function of the denaturant concentration. Solid lines 
represent the simultaneous nonlinear least-squares fits of all the 
data to eq 3 using a single two-state transition: (O) tryptophan 
emission from D-PGK (Xtx = 295 nm), (□) tryptophan emission 
from D-PGK-A (A« = 295 nm), (V) AF emission from D-PGK-A 
(Aex ^ 460 nm), (a) AEDANS emission from D-PGK (Aex = 340 
nm), and (O) fluorescence steady-state energy-transfer efficiency 
from the ratio of D-PGK and D-PGK-A (Aex = 340 nm, donor- 
side emission). 

altered intrinsic stability. This was a very important result 
which could not have been predicted beforehand; singly and 
doubly labeled proteins have identical (within experimental 
error) stabilities. 

Characterization of the site-specific changes in acceptor 
(AF) absorption and emission (total intensity and anisotropy) 
was also performed (data not shown). It was found that all 
AF-labeled sites have nearly identical photophysical proper- 
ties. There is essentially no change in the fluorescence 
quantum yield and molar extinction coefficient for all the 
different sites in the folded proteins. The absorption peak 
is in the range of 492-494 nm. When the protein unfolds, 
the emission and absorption spectra are 2—4 nm red-shifted, 
and the quantum yield slightly decreases from 0.8 to 0.75. 

The final photophysical parameter required to completely 
characterize this system was the rotational characteristics of 
the fluorescence probes attached to each of the individual 
sites. Time-resolved and steady-state anisotropy experiments 
were performed. The steady-state anisotropics of AEDANS 
from each of the individual sites were 0.076 (Cys290), 0.081 
(Cys412X 0.070 (Cys75), 0.047 (Cysl35), and 0.053 (Cys202). 
In the unfolded state, all of the observed anisotropics dropped 
to 0.02. The time-resolved anisotropics for all of the mutants 
partitioned into approximately 50% fast local motion (600 
ps) and 50% global motion of the entire protein (20-30 ns) 
(Table 3). In the unfolded state, all of the probes revealed 
70% of a 600 ps motion and 30% of a slower component of 



Table 4: Estimated (k^)* Value, Maximum and Minimum 
Limits, and Forster Distances for All Six Energy-Transfer Pairs 



energy-transfer 
pair 


(±0.2) 


(±0.02) 


(±0.1) 


/2o(V3j 

(A) (±1) 


(A) (±2) 


135 — 290 


1.30 


0.19 


2.6 




45.6 


290^ 135 


1,26 


0.22 


2.5 


43.2 


50.2 


135 — 412 


0.59 


0.25 


23 


41.9 


40.0 


412—135 


0,68 


0.20 


2.6 


42.4* 


42.6 


290 — 412 


2.11 


0.20 


2.6 




54.8 


412-290 


2-47 


0.12 


3.1 


43.8 


52.8 


75 — 290 


1,25 


0.15 


2.9 




47.0 


290 — 75 


1.24 


0.14 


3.0 


44.0 


50.1 


202 — 412 


1.67 


0.26 


2.2 




47.6 


412 — 202 


2,16 


0.13 


3.0 


41.7 


51.6 


412 — 75 


1,06 


0.12 


3.1 


42.4 


45.8 



" Ro from (k^)*, and J and Qd, for each Cys, singly labeled PGK. 
* Specific labeled sample. All unfolded proteins were found to have a 
(tc^) well approximated by Vi, yielding an Ro(y^) of 41 A. 



«4 ns. From the measured time-resolved anisotropics of 
each of the probes at each of the individual sites, site-specific 
depolarization factors were obtained. From the measured 
limiting anisotropy for the fast local motion QSg), we 
calculated an approximate cone angle (over which the probes 
move) of 35'' for Cys290-, C:ys412-, and Cys75-labeled 
positions and between 40 and 50*^ for Cysl35, Cys202, and 
the unfolded proteins. Applying averaging analysis, the 
maximum and minimum {K:^)mzx and (fc^)nimvalues from the 
measured depolarization factors (dx/) and {(1^^) (Dale et al,, 
1979) were calculated. The calculated site-specific (k^) limit 
values are between 0.12 and 3,0 for all the different positions 
(Table 4). For experimental and theoretical reasons described 
below, this uncertainty in the (k^) value represents a large 
overestimation of the actual value. An in^roved orientation 
estimation procedure was also utilized, which makes use of 
the site-specific crystal structure, most probable fluorophore 
location, and measured time-resolved anisotropy mobilities 
[see (k^)* (Table 4)]. Although it is very difficult to estimate, 
the uncertainty in the (/c^)* determined in this manner 
(described in detail in Materials and Methods) is ap- 
proximately ±0.2. From the measured time-resolved anisotro- 
pics of the unfolded proteins (both donor- and acceptor- 
labeled), it is clear that (ic^) = V3 is a very good approximation 
to this state. 

To experimentally test the limits of the effect of the (k^) 
orientation factor, as well as the lifetime heterogeneity on 
the recovered energy-transfer distance determination, a 
detailed energy-transfer study was performed on multiple 
specifically labeled proteins which combined sites (rys412 
with Cysl35. This pair represents an extreme in changes in 
donor orientation factor, with site Cys412 and Cys 135 having 
donor depolarization factors ({do')) of 0.84 and 0.64, 
respectively (eqs 6 and 7). This case also represents a 
significant change in lifetimes, in that the specifically labeled 
site 412 has a donor lifetime that is almost purely single- 
exponential (97% of a 1 7 ns lifetime), con:^)ared to the mixed 
labeled sample which consists of approximately 84% of a 
17 ns component and 15% of a 5.1 ns lifetime term. The 
experimentally recovered intersite distances were found to 
be identical for these two extreme cases (Table 5; con^are 
135 412 with 412 135). Since these two independent 
distance measurements are performed on proteins represent- 
ing an extreme in site-specific rotational mobility and altered 
lifetime properties, we conclude that the rotational and 
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Table 5: Comparison of the Measured FRET Distances with Tliat Predicted from the Crystal Structure' 



energ>'-transfer 
pair 



measured 
steady-state 
distance (A) 



measured time-resolved 
discrete distance 

~7(A) ^ 



measured time-resolved 
distance distribution 



[±2] 



r(A) 



crystal 
structure 



estimated 
dye-to-dye 
distances (A)^ 



135** 290 


43 


43.3 


2.7 


39.4 


7.3 


1.3 


39 


39 






40.3^ 


1.6 


38.8^ 


6.1 


1,2 






135** 412 


40 


40.4 


2.7 


39.5 


3.8 


1,3 


40 


46 


412—135 


40 


39.5 


2.1 


38.0 


3.9 


1.2 










38.7 


1.4 


38.1*= 


3.4 


1.3 






290** 412 


69 


63.6 


1,4 


64.8 


13.5 


1,3 


48 


56 






56-6'= 


1,8 


58.6*= 


13,2 


1.4 






75 290 


50 


51.7 


4.3 


46.6 


13.5 


1.2 


40 


46 


202** 412 


39 


41.7 


1.5 


37.8 


6,6 


1.1 


26 


34 


412 — 75 


47 


48.2 


3.1 


44.8 


13.5 


1.4 


32 


46 


all^ 




60-70 


1.1 


60-80 


15-30 


1.1 







" Watson et al. (1982). * Donor-to-acceptor distance from MD simulations based on Watson ei al. (1982) crystal structure. *= Acceptor-side FRET 
measurements, Unfolded samples (MOPS buffer at pH 7.5 and 25 °C and 2 M GuHCl). ' MOPS buffer at pH 7.5 and 25 X. D ** A: average 
labeled samples (donor distributed bet>\'een the two Cys sites). D — A: specific labeled san^les. Unless otherwise indicated, distance determinations 
are from donor-side experiments. The errors on the recovered distances are dominated by **nonfitting** sources and are estimated to be ±3 A (see 
the text). 



lifetime effects on the -rest of the energy-transfer pairs 
described in this work are probably relatively smaU. 

Steady-State Energy Transfer, The doubly labeled PGK 
sample shows a major emission band due to AF fluorescence 
and a small contribution of AEDANS fluorescence (Figure 
3, top). Donor emission is quenched through FRET from 
AEDANS to AF when the protein is folded The fluores- 
cence changes are measured by comparing the fluorescence 
emission spectra of the D-PGK and the D-PGK-A labeled 
proteins for the native samples at the same concentration. 
The steady-state average FRET efficiencies {£) for the native 
and the unfolded state are obtained from the ratio of the 
integrated fluorescence emission spectra of the D-PGK-A 
and D-PGK samples on the donor side (see Data Analysis, 
Aex = 340 nm, Aem — 400—460 nm). Since the donor-side 
spectrum can be obtained over a region with absolutely no 
overlap with the acceptor emission, calculation of the transfer 
efficiencies is especially easy, and not prone to artifacts 
associated with spectral subtraction of the acceptor-only 
emission. The steady-state calculated average distances are 
presented in Table 5. For all the folded and unfolded 
samples, the scaled donor emission spectrum is. subtracted 
from the donor/acceptor spectrum. The shape of the differ- 
ence spectrum is coincident with the scaled acceptor emission 
spectrum that is obtained when the acceptor is excited 
directly (A« > 440 nm). 

AEDANS and AF fluorescence emission spectra have 
significant overlap. In the acceptor-side measurements (Aem 
> 480 nm), FRET efficiencies were calculated from the AF 
emission enhancement (data not shown). However, due to 
the large number of extra manipulations of the data required 
to monitor the acceptor enhancement, the precision of the 
acceptor-determined measurements was much inferior to the 
donor-quench determinations. There was, however, very 
good agreement between the two methods, with differences 
in calculated energy -transfer efficiencies of <5%. The 
biggest source of error in these determinations comes from 
uncertainties in absolute protein concentrations and from 
subtraction of the directly excited acceptor emission (Xex = 
460 nm). 

For the unfolded samples, the differences between the 
D-PGK and the D-PGK-A spectra are within the experi- 



mental error. Time-resolved experiments (described below), 
however, can be performed much more accurately and allow 
the estimation of the intramolecular distances in the unfolded 
state. 

Time-Resolved Energy Transfer. Time-resolved fluores- 
cence Hfetime measurements were performed on all six FRET 
pairs (donor side, Aex = 325 nm, X^n = 450 nm). Three 
fluorescence lifetimes were observed for all the D-PGK 
decays. The fluorescence decays of D-PGK-A samples were 
also satisfactorily analyzed with an empirical three-expo- 
nential model (xg^ = \A-\2). Analysis of the time-resolved 
fluorescence data was performed at three different levels of 
. complexity: (1) average energy-transfer efficiency calculated 
from the mean lifetime of donor and donor/acceptor proteins, 
(2) global analysis of donor and donor/acceptor decay in 
terms of discrete distance(s), and (3) global analysis of donor 
and donor/acceptor decay in terms of a distribution of 
distances. 

The first level of analysis of the time-resolved data was 
analysis of the D-PGK and D-PGK-A fluorescence decays 
in terms of empirical exponential functions. From these 
recovered lifetimes (see Materials and Methods), one can 
calculate energy-transfer efficiencies using ^ = 1 — ({tda)/ 
(td)). The main purpose of this analysis was simply to 
determine if the steady-state transfer efficiencies and time- 
resolved data were internally consistent. Since the time- 
resolved data did not require knowledge of the absolute 
concentrations of the donor-only and donor/acceptor samples, 
uncertainties in protein concentrations would not contribute 
to a measured efficiency. Similarly, the steady-state transfer 
efficiencies can be affected by static quenching mechanisms 
which will not affect the time-resolved signals. When the 
data analysis is performed in this very simple manner, one 
fmds that the recovered distances determined by both 
methods are in complete agreement (see Table 5). These 
results svq)port the accuracy of both the protein concentration 
determination of each of the mutants and the fact that the 
majority (if not all) of the donor quenching which is observed 
in the steady-state measurement is dynamic in nature and 
consistent with an energy-transfer process. It should be noted 
that the small amplitude (<1% of total fluorescence) short 
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lifetime component was invariant between donor and donor/ 
acceptor samples and hence does not appear to be undergoing 
any fluorescence energy transfer. Therefore, in all of our 
subsequent analyses, no energy-transfer distance was associ- 
ated with this short lifetime term and its value was linked 
between both the donor-only and donor/acceptor samples. 

Global analyses were then performed by simvdtaneously 
analyzing the decay curves of the D-PGK and D-PGK-A 
samples in terms of an internally consistent distance(s). Both 
discrete and distributed distances (parameterized as Gauss- 
ians) were utilized. Global fits in terms of a single discrete 
distance were not satisfactory for any of the native proteins. 
Severe distortions in the weighted residual distribution were 
apparent, and the reduced Xg values for most of these 
analyses were >2.0. Global analysis of the time-resolved 
energy -transfer data in terms of a single distributed distance 
yielded the best Xg^ and random residual pattems with the 
least number of adjustable fitting parameters. Protein 
conformational heterogeneity, orientational factor, and the 
length and flexibility of the hnker of the donor and acceptor 
probes may be contributing to this distribution. The global 
fitting in terms of a Gaussian distance distribution gave a 
satisfactory fit to the data for all six donor/acceptor pairs. 
Random residual pattems (such as those shown in Figure 3, 
bottom) and reduced global Xg^ values in the range of 1.1 - 
1 .3 were obtained for all cases. The mean distance from 
the single-distance distribution analysis, Re, was in good 
agreement with the corresponding discrete single-distance 
analysis. 

Three of the six energy-transfer pairs were also examined 
by global analysis of data obtained at the acceptor- side 
emission, with excitation at 325 nm and emission at 560 nm 
(Table 5). Nearly identical distance distributions are obtained 
fi-om the acceptor-side fluorescence emission, further strength- 
ening the fact that the observed quenching in this system is 
dominated by energy transfer and not some other mechanism. 
For the unfolded proteins (2 M GuHCl), only small energy- 
transfer efficiencies are observed. Since the transfer ef- 
ficiencies are rather small, it is not possible to perform the 
sophisticated type of distribution analysis which is performed 
for the native proteins. The global analyses were consistent 
with either a single discrete distance model or a broad 
distribution distance model. All of the energy-transfer 
signals are reversible, as refolded proteins were found to have 
distance distributions nearly identical to those of proteins 
which were never previously unfolded. 

Figure 6 shows the recovered donor— acceptor distance 
distributions of all six doubly labeled PGK derivatives. The 
mean of the recovered donor— acceptor distance distribution 
matches that predicted from the crystal structure almost 
exactly for 75 290, 412 — 75, 202 ^ 412, and 135 
290. These four sites span quite different regions of the 
tertiary structure of PGK (for donor/acceptor pair locations, 
refer to Figure 1). For 290 41 2, the recovered mean from 
the donor-side decay is 65 A, while the predicted dye-to- 
dye distance is 56 A. The acceptor-side measurements 
yielded a distance of 59 A, much closer to the predictied 
distance for this case. The recovered mean distance from 
135 412 and 412 135 is 40 A (compared to a predicted 
separation distance of 46 A) when measured from either the 
donor quench or acceptor enhancement (three independent 
measurements, two different samples). Given the identical 
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Figure 6: Recovered energy- transfer distance distributions from 
the global analysis of the time-resolved fluorescence decays of 
D-PGK and D-PGK-A. The orientation factor (k^)* was utilized 
for each analysis. The predicted dye-to-dye distances (♦) from the 
molecular dynamics simulation of each dye pair built into the PGK 
crystal structure are shown for comparison. 
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Figure 7: Linear correlation of the measured energy- transfer 
distances (mean of the recovered distribution function) as a function 
of the predicted dye-to-dye distance from the molecular dynamics 
simulation of each dye pair built into the PGK crystal structure. 
The regression line and data for both the (k^) = and the enhanced 
orientation factor estimation {k^* are shown. A "perfect" distance 
assay would be a line extending exactly along the diagonal. 

energy-transfer distances which are obtained at this position, 
it is entirely possible that the majority of the "error" 
associated with this site has to do with the calcdation of 
the projected dye-to-dye separation distance. 

A plot of the recovered energy-transfer distances as a 
function of predicted dye-to-dye distances (xising the PGK 
crystal structure) is shown in Figure 7. Independent of the 
(1^) or type of data (time-resolved or steady-state fluores- 
cence), linear correlations are observed for the recovered 
distances versus predicted distances from the crystal structure. 
The correlation coefficients for these lines are 0.97 and 0.96 
for the time-resolved (k^) (=^3) and the {k^}* respectively. 
The slope of the linear regression line of the time-resolved 
energy-transfer distance estimations with (/c^)* is 1.18, and 
with {k^} (=^3) 0.85. This result compares very favorably 
with the ideal-case distance assay which would have a slope 
of 1 .0, indicating that, for all of the donor/acceptor probes 
(when examined as a total system), the measured distances 
reflect the predicted structural distances to within <20%. 
For these regression curves, the 135 412 energy-transfer 
pair was omitted due to probable errors in the calculation of 
the predicted dye-to-dye distance (see above). However, 
even when this additional FRET pair is included, the 
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correlation coefTicient is still very good (r = 0.89), and the 
slope is 1,16. 

CONCLUSIONS 

The determination of long-range distance changes in 
proteins (10-100 A) is an area that only fluorescence energy- 
transfer measurements (FRET) are currentiy capable of 
making. This work represents the first case study in which 
six independent donor/acceptor pairs have been engineered 
into a single protein for the expressed purpose of making 
distance measurements during a protein folding reaction. 
Essential for this study were the determinations that the 
fluorescence probes (themselves) did not alter the folding 
reaction and that the FRET assay was dominated by changes 
in distance and not by some other spectroscopic effect. 

A systematic investigation of (up to) nine singly and 
doubly labeled proteins at each of the six donor/acceptor sites 
was able to conclusively show that the stabilities of each of 
the labeled proteins were identical (Figure 5 and Table 2). 
This was not exactly an expected result, in that it would be 
very easy to imagine that the inherent stability of the doubly 
labeled PGK would differ from that of the singly labeled 
proteiQ. There are, however, cysteine sites in PGK where 
this strategy will not work (e.g., the native Cys97; data not 
shown) where flie stability of the protein labeled at this site 
is dramatically altered. In these studies, we have avoided 
all sites with altered stability characteristics. We have found 
that these "extra-sensitive sites" are in a minority and that, 
by using the crystal structure as a guide, cysteine sites can 
often be engineered into a protein and labeled with fluoro- 
phores without producing a significant perturbation in 
stability. 

A variety of spectroscopic effects could (in theory) 
dominate the energy -transfer efficiencies which are measured 
in a native (or partially folded) structure. Therefore it was 
absolutely imperative to show that the recovered energy- 
transfer distances correlated with distances predicted from 
the crystal structure of PGK, Figures 6 and 7 reveal that, 
over the measured distance range of 30-70 A, the multisite 
distance assay which has been engineered into PGK ac- 
curately measures the distances predicted from the crystal 
structure to within 20%. Examination of Figure 1 reveals 
that this distance assay system will be capable of quantitating 
long-range distance changes between the amino and car- 
boxy 1-terminal domains, within the amino-teiminal domain, 
as well as ^'unraveling" of the C-terminal end from the 
amino-terminal domain. The use of this multisite FRET- 
based distance assay to monitor the kinetics of xmfolding of 
PGK is described in the following paper. 
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ABSTRACT Fluorescence resonance energy transfer and 
fluorescence polarization anisotropy are used to investigate 
single molecules of the enzyme staphylococcal nuclease. In- 
tramolecular fluorescence resonance energy transfer and 
fluorescence polarization anisotropy measurements of fluo- 
rescently labeled staphylococcal nuclease molecules reveal 
distinct patterns of fluctuations that may be attributed to 
protein conformational dynamics on the millisecond time 
scale. Intermolecular fluorescence resonance energy transfer 
measurements provide information about the dynamic inter- 
actions of staphylococcal nuclease with single substrate mol- 
ecules. The experimental methods demonstrated here should 
prove generally useful in studies of protein folding and enzyme 
catalysis at single-molecule resolution. 



Single-molecule spectroscopy can provide information about 
complex biological molecules and systems that is difficult to 
obtain from ensemble measurements (1-6). For example, one 
can observe the time trajectories of single molecules in bio- 
chemical reactions that cannot be synchronized in ensemble 
experiments. In a population of molecules heterogeneous in a 
particular property, single-molecule spectroscopy can also 
resolve and quantitatively compare distinct subpopulations 
that woxild be indistinguishable at the ensemble level. 

Fluorescence resonance energy transfer (FRET) measure- 
ments between single pairs of acceptor and donor fluoro- 
phores can yield information about structural relationships 
and distance fluctuations between regions of a single biomol- 
ecule or between components of an interacting system of 
biomolecules (7-10). In addition, the rotational dynamics of a 
single f luorophore can be probed by monitoring fluctuations 
in fluorescence polarization (11-14). Here we develop the 
techniques of single-pair FRET (spFRET) and single - 
molecule fluorescence polarization anisotropy (smFPA) and 
show how they can be used to observe the conformational 
fluctuations and catalytic reactions of enzymes at single- 
molecule resolution. 

Staphylococcal nuclease (SNase) is a 19 kDa Ca^"^- 
dependent enzyme that catalyzes the hydrolysis of DNA and 
RNA into mono- and dinucleotides (15). Its catalytic mecha- 
nism, thermodynamic stability, and folding pathway have been 
studied extensively at the ensemble level (16-21). To probe the 
conformational dynamics of SNase and its interactions with 
substrate at single-molecule resolution, three experimental 
methods were used. First, intramolecular spFRET was mea- 
sured between donor and acceptor fluorophores attached to 
single SNase proteins. Second, single-molecule fluorescence 
polarization anisotropy measurements were performed by 
using SNase labeled singly with one type of f luorophore. Third, 
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m/ermolecular spFRET was measured between donor-labeled 
SNase and acceptor-labeled DNA substrate. 

Using intramolecular spFRET measurements on single 
SNase protein molecules, we observe interesting dynamics 
including gradual fluctuations in the FRET efficiencies. A 
combination of smFPA measurements, simulations, and spec- 
tral fluctuation measurements shows that distance fluctuations 
must contribute to the observed FRET efficiency fluctuations. 
We also use smFPA and spFRET to study protein-inhibitor 
binding and show that these methodologies are sensitive 
enough to distinguish between the free and inhibitor-bound 
states of the enzyme. In the intermolecular spFRET part of 
this work, we study the interactions between single SNase 
proteins and single-stranded DNA (ssDNA) substrates and 
develop methods that are useful for studying complex aspects 
of enzyme catalysis such as processivity and the statistics of 
enzymatic events. In the course of this work, we have also 
developed data analysis methods such as autocorrelation 
analysis of spFRET and smFPA time-trace data that will be 
generally useful in single-molecule experiments. 

MATERIALS AND METHODS 

Protein Conjugation. Tetramethylrhodamine (TMR, do- 
nor) and CyS (acceptor) were selected as the FRET pair for 
their well-separated emission wavelengths (>100 nm) and 
large Forster radius [assmning isotropic and rapid rotation for 
both TMR and CyS (k^ = 2/3) and applying the available 
spectral data, the calculated 50% energy transfer distance, or 
is 53 A, according to the equation 

Ro^ = 8.79 • 1023(rt - • (pD •/ • K)2A^ 

where n is the index of refraction of the mediiun, <pj) is the 
donor quantum yield, is the orientational factor, and /is the 
spectral overlap of donor emission and acceptor absorption in 
M'^'cm^]. TMR maleimide (Molecular Probes) was conju- 
gated to the Cys^^ of the K28C mutant of SNase in 50 mM 
glycine, pH 7.0/2% dimethyl sulfoxide over 2 h at 25°e. The 
conjugate was purified by size-exclusion chromatography with 
Sephadex G-25 resin. Dye attachment was verified by dena- 
turing polyacrylamide gel electrophoresis. Cy5 succinimidyl 
ester (Amersham Pharmacia) was conjugated to TMR-SNase 
under the same labeling conditions and purified and charac- 
terized as described above. High-resolution mass spectrometry 
showed that about 15-20% of protein molecules were conju- 
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gated to only one Cy5 f luorophore with a smaller percentage 
of double labeling. The Km and Kmax values of TMR- and 
TMR/Cy5-dual labeled SNase were indistinguishable from 
those of the unlabeled protein. 

DNA Conjugation. The sequence of 5'-end-Cy5-labeled 
oligonucleotide was 5 'XACGTCACGCTAGTCAGTCATCT- 
TGCACATGTCCTTGAGA, where X represents Amino 
Modifier dT (Glen Research, Sterling, VA). The thymidine 
analog was labeled with Cy5 succinimidyl ester in 0.1 M sodium 
carbonate buffer, pH 9.0, over 2-12 h at 25"C. The conjugate 
was first passed through a NAPS column (Pharmacia) with 
water, then purified by RP-HPLC using a gradient of 5 -> 30% 
acetonitrile in 50 mM triethylammonium acetate. A FPLC 
Superose 12 column (eluent of 20 mM NaH2PO4/150 mM 
NaCl, pH 6.3) was used to remove residual free dye. 3 '-end- 
labeled oligonucleotide was prepared in an identical manner 
using 3' Amino Modifier C7 CPG (Glen Research). 

Protein Immobilization. Glass coverslips were derivatized 
with N-[(3-trimethoxysilyl)propyl] ethylenediamine triacetic 
acid (United Chemical Technologies, Bristol, PA). Specific 
immobilization of proteins by means of hexahistidine tag was 
established as follows: (/) more than 90% of immobilized 
proteins could be removed by flowing over the coverslip 100 
mM EDTA, pH 7.9 buffer, and («) without the prior appli- 
cation of Ni^"^ to the derivatized glass, a 10-fold reduction in 
SNase binding was observed. 

Experimental Setup. A laser scanning confocal microscope 
with a focused laser spot (0.4 fim) was used to image surface- 
immobilized protein molecules and measure their individual 
emission time trajectories (9, 12, 13). For spFRET measure- 
ments, donor and acceptor emissions were separated by a 
630- nm dichroic mirror and collected through a 585-nm band 
pass filter and 650-nm long pass filter, respectively. smFPA 
measurements of fluorophore rotational dynamics were per- 
formed by splitting the emission from a single fluorophore 
with a polarizing beam-splitting cube into two orthogonally 
polarized components, /s and /p, and simultaneously recording 
them on separate detectors (12, 13). To measure the dipole 
effects in absorption, the excitation beam was alternated 
between two orthogonal polarizations by using an electro-optic 
modulator. Argon ion laser light (15 /iW) at 514 nm was used 
for studying the energy transfer, as well as the rotational 
dynamics and spectral fluctuations of the donor, whereas 15 
of HeNe laser light at 632 nm was used to study the 
rotational dynamics and spectral fluctuations of the acceptor. 
The spectral fluctuations of donor and acceptor f luorophores 
conjugated to SNase were measured by splitting the fluoro- 
phore emission spectrum with a dichroic mirror with a cutoff 
at the peak emission and simultaneously recording the two 
emission halves on separate detectors. 

spFRET Analysis. The instantaneous FRET efficiency £fr) 
was calculated from the donor and acceptor emission inten- 
sities, /d and /a, according to the equation £" = [1 + y/d//a]~S 
where y is a correcrion factor determined to be 0.8 in the 
following manner. Another expression for E is [1 + 73</>a/ 
^<t>d]~^ where and are the true donor intensity and the 
sensitized emission intensity of the acceptor in the presence of 
energy transfer. The measured intensities are reduced by 
factors of rja and Tjd because of overall instrument detection 
efficiencies. It can be shown that the correction factor y 
(=Tja<^>o/Tjd<f>d) for the expression £ = [1 + Y-^dZ-^o]"* is equiv- 
alent to A/a/A/d , where A/a and A/d are the acceptor and 
donor intensity changes, respectively, on acceptor photo- 
bleaching. A/a /A/d was determined from 45 acceptor photo - 
bleaching events, and its distribution was centered at 0.8. The 
E(t) fluctuation time traces were processed as follows. A 
median filter (five points) was used to remove spikes in the 
data caused by triplet state-induced blinking. Dark states were 
discounted by skipping points with /d + /a < 10 counts oiE < 
0.3 (gradual fluctuations that lead to E < 0.3 were not 



observed, so this screening step does not remove them). The 
time average of E(t), E, and its autocorrelation function were 
calculated from the remaining data points. The autocorrela- 
tion function was fit with a^exp(-tlTE), where and Tfe are the 
E(t) fluctuation amplitude and time scale. 

smFPA Analysis. Rotational fluctuations were quantified by 
calculating the effective emission dipole angle 0emW- That is, 
although smFPA measurements do not directly report the 
in-plane dipole orientation when the fluorophore is rotating, 
it is possible to define an effective average dipole angle 
parameter Bcm(t) in the following way: 

0em(O = tan-*[(/,p(/i) + /,,(«))/(/p,(/i + 1) + /pp(n + \))f-' 

[1] 

where /jj are a set of four consecutive data points taken at times 
n and n + \, The first index denotes the excitation polarization 
and the second denotes the emission polarization. The angle 
parameter ©emW can be thought of as the average in-plane 
cone angle center of the rotating emission dipole measured 
with circularly polarized light at the mid-time point between n 
and + 1. The time average of BtmW autocorrelation 
function were calculated. The autocorrelation function was fit 
with aR2exp(-t/TR), where ar and tr are the Qcm(t) fluctuation 
amplitude and time scale. 

RESULTS AND DISCUSSION 

A laser scanning confocal microscope with a focused laser spot 
(0.4 /i,m) was used to image surface-immobilized protein 
molecules and measure their individual emission time trajec- 
tories (9, 12, 13). SNase proteins were synthesized for imaging 
with C-terminal hexahistidine tags and a Lys^** to Cys mutation 
(22, 23), labeled at Cys^** with the energy donor TMR and 
nonspecifically labeled with the energy acceptor Cy5. The 
labeled proteins were immobilized on Ni^^-diaminetriacetic 
acid-derivatized glass plates at a density of less than 1 protein 
per 2 area in 40 mM glycine, 10 mM CaCh, pH 9.5 (buffer 
A). In some cases, an oxygen-scavenging system [50 ^g/ml 
glucose oxidase, 10 /Ltg/ml catalase, 18% (wt/wt) glucose, \% 
(wt/vol)] p-mercaptoethanol) was added to prolong fluoro- 
phore lifetimes (1). 

Protein-conjugated donor f luorophores were selectively ex- 
cited with circularly polarized 514 nm Ar"^ light. The sample 
stage was raster-scanned as the donor and acceptor emissions 
were simultaneously recorded on separate detectors. The two 
resulting images were combined to form a composite image 
with donor and acceptor signals false colored in green and red, 
respectively. Each spot in the composite of Fig. \a represents 
a single protein. During the course of the scan, most molecules 
initially show large acceptor emission with no or very small 
donor emission. Then abrupt photobleaching of the acceptor 
occurs, concurrent with an increase in donor emission. This 
anticorrelated switch in emission intensities is evidence that 
the single donor fluorophore was transferring energy to the 
acceptor f luorophore(s) before photobleaching of the accep- 
tor (9). 

When an individual SNase molecule is positioned under the 
laser excitation spot, it is possible to record the donor and 
acceptor emissions as a function of time with 5-msec resolu- 
tion; a representative time trace is shown in Fig. \b. The 
inverse relationship between donor and acceptor emission 
intensities at 80 msec is again indicative of spFRET. In the 
subsequent analysis of spFRET, we restrict our attention to 
those that show the simultaneous acceptor photobleaching and 
donor signal recovery event to screen out SNase molecules 
with no acceptor or multiple acceptors. In principle, this does 
not completely exclude the possibility of multiple acceptor 
labeling; the photobleaching of one of multiple acceptors that 
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Time (msec) 

Fig. 1. (a) Dual-color composite image of doubly labeled SNase 
molecules immobilized by means of histidine tag in buffer A at 23°C. 
The 514-nm excitation laser spot (Ar**" light, 15 ;i,W) was scanned from 
top to bottom and left to right. Each spot represents a single protein. 
Donor emission is colored green and acceptor emission is colored red. 
(b) Emission time trace of a single double-labeled SNase molecule. 
The resolution of the measurement is 5 msec. The instantaneous donor 
and acceptor emission intensities, /a and I a (squares and circles, 
respectively), are related to the energy transfer efficiency by the 
expression Eft) = [1 -I- yla/h]'^, where 7 is a correction factor 
determined to be 0.8. The SNase molecule shown here displays a very 
high degree of energy transfer (high acceptor intensity, low donor 
intensity) from 0 to 80 msec. At 80 msec, photodestruction of the 
acceptor occurs and donor emission simultaneously increases. TTie 
inverse correlation is direct evidence for spFRET. 

are in close proximity to the donor will not show up as a 
detectable event since the energy pumped through the donor 
will be redistributed among the remaining acceptors. 

In addition to abrupt photophysical events such as photo- 
bleaching and blinking, the emission time traces of many 
doubly labeled SNase molecules were found to display gradual 
fluctuations in the donor and acceptor emissions, ranging in 
time scale from 10 msec to 1 sec (Figs. 2 a-c). These temporal 
fluctuations may reflect conformational changes in the protein 
side chains or backbone that give rise to changes in energy 
transfer efficiency, £(/), according to the relationship E{t) - 
[1 + {Rit)/RQ)'*]~\ where R{t) is the time-dependent distance 
between donor and acceptor fluorophores and Ro is the 
Forster radius. However, fluctuations in E(t) can also arise 
from other factors, namely fluorophore reorientation and 
spectral shift. To quantify the contribution of these latter 



factors to E(t) fluctuations, the emission characteristics of 
SNase molecides labeled singly with TMR or Cy5 were mea- 
sured. 

smFPA measurements of fluorophore rotational dynamics 
were performed by splitting the emission from a single flu- 
orophore with a polarizing beam-splitting cube into two or- 
thogonally polarized components, /« and /p, and simulta- 
neously recording them on separate detectors (13). To mea- 
sure the dipole effects in absorption, the excitation beam was 
alternated between two orthogonal polarizations. A typical 
smFPA time trace for a single TMR fluorophore attached to 
Cys^** of SNase is shown in Fig. 3a, As previously demonstrated 
for fluorophores tethered to DNA, immobile fluorophores 
generate correlated /« and /p signals, whereas rapidly rotating 
fluorophores generate anticorrelated signals (12, 13). Data 
collected and analyzed for over 100 TMR-labeled SNase 
molecules indicate that Cys^^-linked TMR rotates rapidly, with 
little restriction, on a time scale that is much faster than the 
integration time (5 ms) and slower than or comparable to the 
radlational lifetime of the fluorophore (Fig. 3 b and c). The 
rotation of Cy5, on the other hand, displays significant fluc- 
tuations on the 10 msec-1 sec time scale (Fig. 3d). Calctilations 
and simulations described below demonstrate that these are 
clearly insufficient to accotmt for the large fluctuations in 
FRET efficiency observed (Fig. 2c). The combination of free 
TMR rotation and hindered Cy5 rotation restricts the orien- 
tational factor in/?„ to the range 1/3 to 4/3 (the worst-case 
scenario involves static dipole orientation for Cy5 that changes 
direction on the millisecond time scale). Assuming no varia- 
tions in distance between donor and acceptor, the reduced 
uncertainty in predicts fluctuations in E(t) that are far 
smaller than those observed, according to the equation AE = 
E{1 — E)^k^/h}. This conclusion holds even for the case of 
nonspecifically Cy5-labeled protein, where some or all of the 
Cy5 molecules might be in the worst-case scenario discussed 
above. In addition, the distribution of Cy5 rotational fluctu- 
ation time scales differs substantially from that of E(t) fluc- 
tuation time scales (Figs. 2c and 3d). 

Observations of spectral shifts by using SNase labeled singly 
with either TMR or Cy5 (estimated by using known dye and 
optical filter spectra) were infrequent; they were less than 5 nm 
in amplitude with time scales >300 msec, indicating that 
spectral shifts make negligible contributions to E{t) fluctua- 
tions. Thus, the above experiments show that, even in the case 
of nonspecific Cy5 labeling, orientational effects and spectral 
shifts cannot account for the magnitude and rime scale of the 
observed fluctuations in FRET efficiency, i.e., these fluctua- 
tions must involve distance fluctuations and likely reflect 
conformational dynamics of the protein itself, possibly ampli- 
fied by interactions with the dyes and their tethers. 

spFRET and smFPA can also be used to probe the effects 
of ligand binding on protein dynamics. For example, TMR 
rotation, while rapid and unrestricted when conjugated to 
ligand-free SNase, displays hindered rotational dynamics and 
temporal fluctuations in the presence of the SNase active-site 
inhibitor deoxythymidine diphosphate (pTp) (Fig. 3 b and c) 
(16-21). Because Cys^^ is not in the immediate vicinity of the 
active site, these changes in smFPA likely reflect changes in 
protein dynamics that result from inhibitor binding, such as 
decreases in backbone or side-chain flexibility. £(/j fluctuation 
time scales are also affected by the inhibitor binding; addition 
of the inhibitor increases the time scales of the FRET effi- 
ciency fluctuations (Fig. 2d). Average values of Tfe and tr were 
determined by fitting the histograms (bin size 30 msec) to a 
single exponent. Simulations indicate that the probability of 
statistical aberration alone giving rise to the differences in time 
constants observed (220 msec vs. 41 msec for Figs. 3d and 2c, 
and 133 msec vs. 41 msec for Figs. 2d and 2c) is less than 0.1%, 
given the niunber of proteins studied. The distributions in Tfe 
and tr must also include the effects of nonspecificity of CyS 
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Fig. 2. (a) Emission time trace of a doubly labeled TMR/Cy5 SNase molecule immobilized on glass in buffer A with an oxygen scavenging system 
(donor emission is dotted, acceptor emission is solid). A median filter was applied to reduce the noise caused by triplet state-induced f luorophore 
blinking. There are large and gradual fluctuations in /a and/o that occur over tens of milliseconds, (b) FRET efficiency time trace calculated from 
a according to E(t) = [1 + yld/h]'^- The inset shows the autocorrelation ofE(t) together with an exponential fit. rcis the characteristic time scale 
of the E(t) fluctuations displayed in this trace, (c) Histogram of E(t) fluctuation time constants tb for 100 doubly labeled SNase molecules. The 
values range from 10 msec to 1 sec, with the average being 41 msec, (d) Histogram ofE(t) fluctuation time constants tc for doubly labeled SNase 
in the presence of active-site inhibitor pTp (50 mM pTp, = 100 nM). These time constants are considerably larger (average = 133 msec) than 
those measured for free SNase. (c) Scatter plot olE(t) fluctuation amplitudes an vs. mean energy transfer efficiencies E for free SNase (squares) 
and pTp-bound SNase (circles). The scatter in E may reflect the nonspecific nature of Cy5 labeling. The solid line represents the maximum possible 
contribution of dipole fluctuations of Cy5 to ac for free SNase. Only the molecules that display large E(t) fluctuations (>70% of the total) are 
shown. The others are concentrated around the bottom right comer (not shown) and are likely caused by Cy5 labeling at a site close to Cys^. 



labeling. However, the data do not show any dominant peaks 
in the fluctuation time-scale distribution; hence it is unlikely 
that biased sampling of certain Cy5 labeling sites within the 100 
molecules studied is responsible for the differences among the 
cases compared. These changes in spFRET could reflect 
changes in protein dynamics that result from inhibitor binding, 
but orientational effects of nonspecifically bound Cy5 labels 
could make significant contributions. 

Intramolecular FRET is a powerful technique that has the 
potential to advance our understanding of the conformational 
states and dynamics of biological macromolecules in equilib- 
rium, on ligand binding, during folding and denaturadon, and 
during catalysis, in ways that ensemble methods cannot. We 
note that intramolecular spFRET can be a more powerful 
technique than smFPA because the former probes the internal 
conformational states in the center- of-mass frame of the 
system and hence is less prone to complications due to the 
overall motion of the biological molecule. 

Inter molecular spFRET can be used to investigate interac- 
tions between a pair of biomolecules, such as TMR-labeled 
SNase and Cy5-labeled DNA substrate (10). Intermolecular 
spFRET measurements were made using the SNase mutant 
D40G (/ccat = 7s" ^ Km = 55 ^M), whose turnover rate is better 



matched to the 5-msec resolution of our current methodology 
than wild-type SNase (^cat = 100 s-\ = 55 ^M) (16-21). 

TMR-labeled protein was immobilized on derivatized glass 
plates by means of histidine tag in buffer A while a constant 
flow (0.1 ml/min) of a 10-nM solution of Cy5-labeled 40-nt 
ssDNA was flowed over the protein. Under these conditions, 
Cy5-labeled DNA adhered to the derivatized glass at a density 
of approximately 10 molecules per 1 fjtm^ and remained fairly 
constant throughout the experiment (the Cy5-DNA molecules 
were visualized by using direct 632 nm HeNe laser excitation 
of the Cy5 f luorophores). Therefore, the probability of FKET 
arising from random proximity between SNase and DNA is at 
most 1/1,000. 

Emission time traces were collected by finding an enzyme 
using the donor emission and waiting for the arrival of an 
acceptor-labeled DNA molecule. Anticorrelated donor and 
acceptor emissions are again indicative of spFRET (Fig. 4a). 
For each measured FRET event, the duration of the FRET 
signal (rime for which there is continuous acceptor emission), 
or Tassoc, was extracted. Tossoc represents the duration of one of 
several possible FRET-generating events: single- instance 
cleavage of DNA substrate by SNase, multiple and successive 
cleavages of DNA by SNase, binding and imbinding of DNA 
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Fig. 3. {a) A typical smFPA time trace of Cys^* TMR-labeled 
SNase immobilized by means of histidine tag in buffer A. The two 
orthogonally polarized emissions, . /« (dotted) and Ip (solid), are 
displayed as a fiinction of time. Because correlated emissions corre- 
spond to a fixed fluorophore dipole and anticorrelated emissions 
correspond to a rapidly rotating fluorophore dipole, the SNase- 
conjugated TMR molecule shown here is rotating rapidly (much faster 
than the data integration time of 5 msec), (b) The angle parameter 
6cm(t) calculated from a. The value of dcm(0 is close to 45° when the 
fluorophore is rotating rapidly with little restriction (14). The break in 
graph represents a dark-state transition. In the inset is the autocor- 
relation of the angle parameter (in circles); there are clearly no 
significant temporal fluctuations in TMR rotaUon on the millisecond 
time scale. Also in the inset is the angle parameter autocorrelation for 
pTp-bouhd Cys^ TMR-labeled SNase (squares) together with an 
exponential fit. Here the rotational fluctuations are substantial, with 
a characteristic time constant tr of 96 msec, (c) Histograms of d for 
immobilized TMR-labeled SNase molecules with and without inhib- 
itor pTp. The distribution is narrowly centered at 45" for uninhibited 
SNase, indicative of free and rapid rotation of the attached TMR 
fluorophore. The TMR of inhibitor-bound SNase, on the other hand, 
displays hindered and fluctuating rotational behavior, indicated by the 
broader mean angle parameter histogram; (d) Histogram of rotational 
fluctuation time constants tr for Cy5-labeled SNase. Only those 
molecules that showed single-step photobleaching were included to 
screen out multiply Cy5-labeled cases. The average value of tr is 220 
msec, considerably longer than the majority of E(t) fluctuation time 
constants (average te = 41 msec. Fig. 2c) . 

by SNase without cleavage, or nonspecific interaction between 
SNase and DNA. The value of Tqssoc may not necessarily 
represent the full enzyme-DNA association time because of 
fluorophore rotation and/or photobleaching. When the inte- 
grated acceptor emission is scatter plotted against Tgssoc for 45 
spFRET time traces, the points fall imder a 1/x-shaped curve, 
indicating that Cy5 photobleaching plays a role in shifting the 
values of Tassoc to lower times. 

To investigate the SNase-catalyzed DNA cleavage reaction, 
Tassoc values were extracted from approximately 200 spFRET 
events recorded between D40G SNase and 40-nt ssDNA 
labeled at either the 3' or 5' terminus with CyS. As a control, 
values of Tassoc were also measured using the SNase mutant 
D21Y, which is approximately 10^-fold decreased in activity 
relative to wild-type SNase. Histograms of Tojkoc values for 
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Fig. 4. (a) Emission time trace of a single TMR-labeled wild-type 
SNase molecule interacting with single CyS-labeled 40-nt ssDNA 
molecule(s) (donor emission in squares, acceptor emission in circles). 
The donor signal decreases every time the acceptor signal increases 
(signifying the initiation of energy transfer) and vice versa (signifying 
the termination of energy transfer); this is direct evidence for spFRET. 
(b) Histograms of Ta<«oc for D40G SNase with 5'-Cy5-ssDNA and 
3'-Cy5-ssDNA. The average duration of the FRET signals measured 
from interactions between donor-labeled SNase and 5 '-end acceptor- 
labeled substrate are longer (average = 257 msec) than those mea- 
sured with 3'-end acceptor-labeled substrate (average = 110 msec). 
This difference is consistent with a SNase cleavage mechanism in 
which the 5' cleavage product is released more slowly than the 3' 
cleavage product, or the enzyme catalyzes cleavage processively in the 
3' to 5' direction, (c) Histograms of DNA-SNase association times, 
TiKsoc, for cleavage-impaired D21Y SNase with 5'-Cy5-ssDNA (left; 
average Tassoc = 169 msec) and 3'-Cy5-ssDNA (right; average Tqssoc = 
162 msec). This control experiment demonstrates that fluorophore 
photophysics and statistical aberrations do not account for the differ- 
ences in association times observed when using 5'- and 3 '-end-labeled 
substrates with the cleavage-competent D40G SNase mutant 

D40G and D21 Y SNase are shown in Fig. 4 b and c. Average 
values of Tassoc Were determined by fitting the histograms (bin 
size 200 msec) to a single exponent, assimiing underlying 
Poisson statistics. 

D21Y SNase, with a A;cat of 1.5 X 10"^ sec-^ is virtually 
incapable of catalyzing DNA hydrolysis, although it can bind 
reversibly to DNA with approximately the same affinity as 
wild-type and D40G SNase (K^ ^ 55 fiM (16-21). Because the 
interactions between D21Y and 3' - or 5 '-end-labeled DNA are 
expected to have identical lifetimes in the absence of hydro- 
lysis, this D21Y system can be regarded as a control for the 
intermolecular FRET experiment with D40G SNase. Histo- 
grams of the compiled Tqssoc values are shown in Fig. 4c, and 
the average values of Tqssoc obtained by using the 5'- and 
3'-end-labeled ssDNAs in the latter case (with D21Y SNase) 
are very similar. It is highly unlikely that differences in 
photophysics when using 3'- and 5'-end-labeled DNA are so 
closely offset by differences between D21Y interaction times 
with these DNAs. Hence, this result indicates that there are no 
significant differences in the photophysics between cases 
where the fluorophore is attached to the 3' and 5' termini of 
ssDNA. 
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In contrast, different average values of Tassoc were obtained 
for the cleavage of 3'- and 5 '-end-labeled DNAs by D40G 
SNase (110 msec and 257 msec, respectively) as shown in Fig. 
4Z). Simulation showed that the probability of statistical aber- 
ration alone generating this observed difference in Tqssoc values 
is less than 0.05%, given the number of proteins studied. 
Therefore, these differences in Tasscw may reflect differential 
off-rates of bound 3'- and 5 '-end-labeled cleavage products. 
The results are also consistent with a 3' to 5' processive 
cleavage reaction in which the enzyme sequentially hydrolyzes 
the DNA without dissociation. In such a mechanism, a 3' to 5' 
processive enzyme wotild remain boimd to the 5 '-end-labeled 
cleavage product and retain a FRET signal during strand 
degradation, whereas the 3 '-end labeled fragment would dis- 
sociate after a single cleavage event. Indeed, earlier results 
from our laboratory showed that a 5'-^^P end-labeled 64-nt 
oligodeoxynucleotide was either degraded into small frag- 
ments or left fully intact by wild-type SNase, suggesting a 
processive cleavage mechanism (24). 

In summary, observations of energy transfer and fluoro- 
phore rotation in single-molecule and single-pair systems are 
capable of yielding important insights into protein structure 
and function. The experimental methodology used here is 
general and may be applied to the study of many biological 
processes on a single-molecule level. Further developments, 
including improvements in our ability to label multiple sites 
selectively in a protein, should make it possible to follow a 
variety of biochemical processes with greater precision. 
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